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ABSTRACT 

The separation of emissions from different astrophysical processes is an important step 
towards the understanding of observational data. This topic of component separation is 
of particular importance in the observation of the relic Cosmic Microwave Background 
Radiation, as performed by the WMAP satellite and the more recent Planck mission, 
launched May 14th, 2009 from Kourou and currently taking data. When performing 
any sort of component separation, some assumptions about the components must 
be used. One assumption that many techniques typically use is knowledge of the 
frequency scaling of one or more components. This assumption may be broken in the 
presence of calibration errors. Here we compare, in the context of imperfect calibration, 
the recovery of a clean map of emission of the Cosmic Microwave Background from 
observational data with two methods: FastICA (which makes no assumption of the 
frequency scaling of the components), and an 'Internal Linear Combination' (ILC), 
which explicitly extracts a component with a given frequency scaling. We find that 
even in the presence of small calibration errors with a Planck-style mission, the ILC 
method can lead to inaccurate CMB reconstruction in the high signal-to-noise regime, 
because of partial cancellation of the CMB emission in the recovered map. While 
there is no indication that the failure of the ILC will translate to other foreground 
cleaning or component separation techniques, we propose that all methods which 
assume knowledge of the frequency scaling of one or more components be careful 
to estimate the effects of calibration errors. 
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1 INTRODUCTION 

Precise observation of the Cosmic Microwave Back- 
ground (CMB), as planned with the Planck space mission 



I Bersanelli & Mandolesi 2000 Lamarre et al. 2000 Tauber 



2004), is of the utmost importance for better understand- 



ing, and confronting with precise observational data, the hot 
big bang model and its theoretical predictions. In this theo- 
retical framework, such observations also permit constrain- 
ing the parameters of the model, as is currently done to a 
lesser extent by a number of previous experiments, such as 
COBE ( |Fixsen et aT)|1997L WMA P ( |Komatsu et al.||2009[|, 

2009[ ), Archeops (Benoit et al 



2003 



Tristram et al. 20051, BOOMERANG 



et al. 



ACBAR (Rcichardt et al 



20061, CBI (Sievers et al. 20091, QUaD (QUaD col 
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laboration: C. Pryke et al. 20081, and VSA ( Rebolo et al 



2004) 



With ever more sensitive instruments, the main source 
of uncertainty in CMB observations, rather than being in- 
strumental noise, is the contamination of the observation 
by foreground emission. Astrophysical foregrounds comprise 
millimeter wave emission from the interstellar medium in our 
own galaxy, as well as emission from compact extragalactic 
sources. 

Component separation methods make use of the differ- 
ent emission laws of different astrophysical components to 
separate them through joint analysis of observations made 



at different wavelengths ( Delabrouille & Cardoso 20071 



Among those methods, the so-called Internal Linear Combi- 
nation (ILC), which makes few assumptions about the phys- 
ical properties of the CMB and the foregrounds, has been 



widely used for the analysis of WMAP data ( Tcgmark et al 
2003) [Eriksen et al.| [2u04l |Delabrouille et al.[ |2009| |Kim| 
et al. 20091. An important assumption of the ILC is that 
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the frequency scaling of the CMB is assumed to be known. 
This is, in principle, a safe assumption, as small tempera- 
ture fluctuations AT of the CMB generate brightness fluctu- 
ations proportional to AT, which scale in frequency like the 
derivative of a blackbody with respect to the temperature, 
at the well measured CMB temperature of T = 2.725 K. 
However, calibration coefficients for each channel, which are 
a multiplicative factor for each frequency, introduce an un- 
certainty in the frequency scalings of the CMB component 
in presence of calibration errors. For space-based missions 
these uncertainties are typically small (well below 1% for 
WMAP or Planck). 

More sophisticated methods for component separation 
have been extensively studied in the community of statis- 
tical signal processing for a variety of applications. These 
methods are part of a field of activity generically desig- 
nated as Blind Source Separation (BSS), or equivalently In- 
dependent Component Analysis (ICA). ICA methods per- 
form separation on the basis of the assumption that each 
of the available observations is a different linear mixture of 
a well defined number of statistically independent compo- 
nents. Such methods generically rely on no prior assumption 
on the scaling coefficients of the components in the differ- 
ent available observations (i.e. on the coefficients of each 
component in the 'mixtures'). In fact, recovering these coef- 
ficients (the so-called 'mixing matrix') is precisely the pri- 
mary target of blind source separation. ICA methods, thus, 
do not typically assume perfect knowledge of the response of 
each channel to the CMB - nor that the CMB contribution 
is the same in all channels. For CMB studies particularly, 
this type of approach has led to the development of a large 



2006 



variety of methods, including CCA ( Bonaldi et al 
Bedini et alT| [2005], FastICA (|Hyvarinen[ |1999^ MICA 
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2003 Cardoso et al. 2008) and CMC A 



I Bobin et al. 2008) . These methods have been used on real 



observational data in a variety of context s (jBonaldi et al 



2007 Maino et al. 2006 Patanchon et al. 2005 1, and com- 



pared extensively on simulated data sets ( Leach et al. 2008 1 



The two main differences between the ILC and ICA 
methods are the following: 

• Whereas ICA is designed to extract the scaling coeffi- 
cients of each of the identified components from the data 
themselves, the ILC assumes perfect knowledge of the scal- 
ing coefficients for the component of interest (CMB); 

• The ILC does not make any assumption about the prop- 
erties of foreground contamination, whereas ICA assumes 
that the data are satisfactorily described by a (noisy) linear 
mixture of independent components. 

Clearly, these methods are bound to be more or less 
adapted to component separation, depending upon the ac- 
tual properties of the data set and on the science objectives 
pursued. In the following we propose to investigate, using re- 
alistic simulations of sky emission and of observational data 
for WMAP and Planck, the relative performance of FastICA 
and ILC in the presence of calibration errors. Such calibra- 
tion errors result in the violation of one of the assumptions 
of the ILC (the prior knowledge of the exact scaling coeffi- 
cients of the CMB in the observations). By contrast, blind 
component separation methods are designed from first prin- 
ciples to estimate the scaling coefficients from the data, and 



in principle should not suffer much from calibration uncer- 
tainties. 

The rest of this paper is organized as follows. In sec- 
tion [2] we describe the ILC and ICA component separation 
methods. We describe our methodology for comparing the 
methods in section [3] In section [4] we present the results 
of our analysis, followed by our conclusions in section [5] We 
also provide a detailed calculation of the effect of calibration 
errors on the ILC in the appendix [A] 



2 ILC AND ICA 

In the following we assume that the available data (maps 
Xi(p) of observed sky) can be written as 



Xi(p) = ais(p) + riiip), 



(1) 



where s(p) is the map of the component of interest (the 
CMB), p indexes pixels in the map, and rii(p) is the contri- 
bution from foregrounds and instrumental noise to the map 
Xi(p). The coefficients at scale the relative amplitude of the 
CMB map in the different available observations. For obser- 
vations in thermodynamic units, and perfect calibration, we 
have Vi, = I. 

2.1 The ILC 

The philosophy behind the ILC is to find the linear combi- 
nation of the available maps Xi which has minimal variance 
while retaining unit response to the CMB map. This linear 
combination, J]\ WiXi(p), is then an estimate s(p) of the true 
CMB map s(p). The ILC weights Wi are found by solving the 
problem of minimizing varj^ i WiXi(p) under the constraint 
Wi — 1. In principle, this last constraint guarantees unit 
response to the CMB, as we have: 



i 



(2) 



In the presence of foregrounds, which induce correlated 
errors from channel to channel, the ILC weights adjust them- 
selves so that the linear combination cancels out as much of 
the foregrounds as possible. The actual weights, however, 
result from a trade-off between canceling foregrounds and 
allowing errors due to instrumental noise in the final map. 

The constrained minimization problem can be solved 
in a straightforward manner using a Lagrange multiplier 
method to impose Yli w i = 1- The resulting weights are 
found to be: 



RT 1 a 



(3) 



where R is the empirical covariance matrix of the observa- 
tions. Note that we have used bold font to denote vectors, 
and have omitted the reference to the pixel value. From here 
on, this notation will be used. The ILC estimator of the 
CMB map s(p) can be written as: 



silc 



a' R- 1 
a* R- 1 a 



(4) 
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The ILC weights, obviously, depend upon the assumed 
scaling coefficients at for the component of interest. It is 
then clear that an error in the assumed scalings changes the 
ILC performance, but by how much? As the ILC attempts 
to minimize the total variance of the output map, the con- 
straint that Widi — 1 plays a critical role in guaranteeing 
that the linear combination does not adjust its coefficients 
to cancel the CMB as well as foregrounds. It is foreseeable, 
then, that calibration errors could, in some cases, impact the 
performance of ILC more severely than just a small overall 
calibration error on the final output map. 



transformation on the data, which sets its covariance matrix 
to the identity by multiplying the data by the inverse square 
root of its covariance. The mixing matrix then becomes a 
simple rotation matrix which, with its smaller number of 
degrees of freedom, is easier to estimate. 

For generating the pre-whitening matrix, we do not 
make direct use of the covariance matrix of the data, as 
with basic FastICA, but instead use the estimated covari- 
ance matrix of the signal as in |Maino et ai] ( |2002[ ). This can 
be understood simply by our modeling of the data (equation 
|6J. Given this data model, the covariance of the observations 



2.2 FastICA 

There is a wide choice of possible ICA methods to extract 
the CMB from multifrequency observations. In this paper, 
we make use of the standard FastICA algorithm as described 



R.T 
R;r 



<(As + 
AR S A* 



n) (As 



in Hyvarinen (19991, with a few minor changes: 



• We subtract an estimate of the instrument noise co- 
variance matrix from the empirical covariance matrix of the 
data. 

• Instead of leaving the estimated signal as being unit 
variance, we set the CMB scaling to be such that the sum 
of the weights is equal to one, mirroring the ILC method to 
ensure unit response to the CMB. 

FastICA is based on the general principle that a sum 
of two different independent probability distributions will 
always tend to be more Gaussian than either of the dis- 
tributions are independently. We can thus extract N inde- 
pendent sources from N channels of data by forming the 
linear combination of the N channels which maximizes the 
non-Gaussianity of the extracted sources. A measure of the 
non-Gaussianity of each source is performed using functions 
such as: 



Y(x)^[E{G(x)}-E{G{y)}} 2 



(5) 



where x is data that has unit variance, and y is a random 
variable drawn from a unit-variance Gaussian distribution. 
Here E{} is the expectation value of the data set or prob- 
ability distribution enclosed and G(x) is some non-linear 
function. Popular choices include a Gaussian, a polynomial, 
or the logarithm of the hyperbolic cosine. Which specific 
choice is best depends upon precisely how the distribution 
of x differs from a Gaussian, though it is clear that for any 
choice of G(x), Y(x) will be zero if x is Gaussian-distributed, 
and positive definite otherwise. In the present paper, we use 
G(x) = x 4 . 

FastICA assumes a model of the data of the form : 



x = As + n, 



(6) 



where now vector s comprises all 'sources' (CMB + fore- 
grounds) , and n is instrumental noise only (for all channels) . 
The objective of the method is to evaluate the mixing matrix 
A, and then use this estimate to invert the linear system. 

In order to optimize estimation of the mixing matrix 
that determines the linear combination of x which repre- 
sents the individual sources, FastICA also performs a pre- 
whitening step. This pre-whitening step exploits the as- 
sumption of statistical independence to perform a linear 



Here the correct covariance matrix to use to whiten the 
signal is AR 3 A*, which we estimate as R^ — R n . The channel- 
channel noise covariance R n is taken as diagonal with the 
diagonal elements estimated from our knowledge of the per- 
pixel noise in each map combined with how much each map 
was smoothed. We have assumed that the signal and noise 
are uncorrelated in the above derivation. 

Having performed the pre-whitening, all extracted 
sources have unit variance and are uncorrelated. To deter- 
mine the overall CMB scaling, we first determine which of 
the sources is the CMB, then use the ILC strategy of setting 
the sum of the CMB weights equal to one. This ensures that 
the level of the CMB in the output is, at least in the case 
of no calibration error, equal to the level of the CMB in the 
maps. 



3 METHOD 

We now turn to the investigation of the impact of cali- 
bration errors on component separation with ILC and Fas- 
tICA. The approach of this investigation consists of generat- 
ing simulated 'observations', with varying calibration errors, 
noise levels, and frequency channels, and compare the per- 
formance of ILC and FastICA at recovering the CMB map. 

Performance is measured in several ways, based on the 
measurement of reconstruction errors of different types. 

Denoting as s(p) the (beam-smoothed) CMB map used 
in the simulation, and as s(p) the CMB map obtained from 
processing the simulated data, the reconstruction error is 
s(p) - s(p). 

This reconstruction error arises from two terms. A mul- 
tiplicative term (i.e. a global calibration error) and an addi- 
tive term. We have 

s(p) = ols(p) + c(p) 

where a is the global calibration coefficient, and c(p) the 
additive contamination by foregrounds and noise. Ideally, 
we aim at a = 1 and c(p) = 0. 

In practice, in both ILC and ICA methods, the final 
map is reconstructed as a linear combination ^2 WiXi (p) of 
the input maps Xi(p). Hence, for simulated data, one can 
compute easily a = ^Wiai and c(p) = ^2 Wini (p)> where 
rii(p) are maps of the sum noise and foregrounds in channel 
i. 

The comparison of the variance of the reconstruction er- 
ror, of the overall response a, and of the contamination c(p) 
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for ILC and ICA gives insight on the relative performance 
of the two, and of the main origin of error, in presence of 
calibration uncertainties. 

3.1 Simulations 

In preparation for the forthcoming Planck space mission, 
simulations for the 9 Planck frequency channels, from 30 to 
857 GHz, as described in the Planck 'BluebookQ are made. 
We also consider simulations in the WMAP frequency chan- 
nels, between 23 and 94 GHz. Sky simulations are performed 
using the Planck Sky Model (PSM) package, version 1.6.^] 
and using the Healpix pixelization. In the simulated obser- 
vations, we introduce a small calibration error, so that each 
of the sky maps is multiplied by a calibration coefficient. 
We consider calibration errors Sa/a of 0.1, 0.2, 0.5, and 1%, 
which implies calibration coefficients typically somewhere 
between 0.99 and l.OlHWe work at the resolution of the low- 
est frequency channel in our simulations, i.e. 33 arcminute 
beams for Planck, and 54 arcminute beams for WMAP. 

Noise compatible with what is expected for the two in- 
struments, for maps smoothed at the resolution of the lowest 
frequency channel, is added to the sky emission. We then 
separate components with both an ILC and with FastICA, 
and analyze and interpret the results. 

3.1.1 Planck Sky Model 

Sky maps are generated using a four-component model of 
galactic emission which includes free-free, synchrotron, ther- 
mal dust, and spinning dust diffuse components. We also 
add emission from several populations of compact sources, 
which comprise ultracompact galactic H-II regions, infra-red 
and radio sources (both galactic and extragalactic), a far 
infrared background emission, and thermal SZ effect from 
a simulated distribution of galaxy clusters. For our Planck 
simulations, maps are generated at 30GHz, 44GHz, 70GHz, 
100GHz, 143GHz, 217GHz, 353GHz, 545GHz, and 857GHz, 
each at nside=1024. For WMAP simulations, maps are gen- 
erated at 23GHz, 33GHz, 41GHz, 61GHz, and 94GHz, each 
at nside=512. Maps are simulated using Gaussian symmet- 
ric beams. Only temperature maps are generated. 

3.1.2 Post-processing of PSM Outputs 

Instrumental noise is added separately after the sky is sim- 
ulated with the PSM. For Planck, we assume uniform sky 
coverage, with noise level corresponding to what is given in 
the Planck 'Bluebook'. Since the FastICA and ILC meth- 
ods require maps that are at the same resolution, we then 
smooth all maps to the resolution of the 30GHz channel, 
which has a Gaussian beam FWHM of 33'. As we use a rel- 
atively low resolution beam, all maps are set to nside=512 
after smoothing. 

After adding noise and smoothing maps to the same 
resolution, we simulate the calibration error by drawing a 

^ http: //www . rssd . esa. int/SA/PLANCK/docs/Bluebook-ESA-SCI (2005) 1_V2 .pdf 

2 

http : //www . ape . univ-paris7 . f r/APC.CS/Recherche/Adamis/PSM/psky-en .php 

3 The calibration error expected for Planck is less than 1% up to 
the 353GHz channel, as given by the Planck 'Bluebook' 




Figure 1. Mask that removes the brightest pixels from the 70GHz 
and 100GHz channels. 

zero-mean Gaussian random variable x with RMS equal to 
the desired calibration error (e.g. a — 0.002 for 0.2% error). 
We then multiply the map by 1 + x. This is repeated for each 
frequency channel, with the same calibration RMS error but 
a different realization of x for each. 

While it makes no difference whether the calibration 
error simulation is performed before or after smoothing, we 
note that it is correct to add the calibration error after the 
noise, as the overall estimated noise level also depends upon 
the calibration of the instrument. As we make use of the esti- 
mated noise covariance between the channels, the estimated 
noise level after smoothing is also computed here. 

3.2 Masking 

For better performance of the FastICA or ILC component 
separation algorithms, it is safer to mask out particularly 
bright sources as well as those with strongly-varying spectral 
properties. The mask is determined making use of a simple 
magnitude-based algorithm. First, we produce a theoretical 
estimate of the expected CMB RMS based upon the WMAP 
power spectrum. We then generate a mask that removes all 
pixels which contain a value larger than four times the CMB 
RMS. 

For our maps, the mask used is a union of the masks 
computed as above from the 70GHz and 100GHz channels. 
We make use of the mask as generated from the first real- 
ization with no calibration error, and do not recompute the 
mask between runs. The resultant mask is shown in fig. [T] 
It is possible that we could obtain better component sepa- 
ration performance through more precise masking, but this 
is not expected to have any impact on the overall results 
of the present paper. The study could have been performed 
with any arbitrary mask, as long as the average CMB to 
foreground ratio is not changed significantly. 

3.3 Monte Carlo 

In order to investigate both the average of the reconstruc- 
tion error and its dispersion, we individually execute each of 
the above steps many times for each chosen set of parame- 
ters, the exact number depending upon the test. Summary 
statistics are then computed across the runs. When com- 
paring different component separation techniques, the exact 
same set of realizations are used. Different choices of the 
calibration error level also make use of the same input sky 
maps. 
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For these simulations, CMB and noise are generated 
from their statistical properties separately in each simu- 
lation. The CMB is a Gaussian realization assuming, for 
all simulations, the same power spectrum, compatible with 
WMAP best fit model, but new phases for each realization. 
Similarly, all realizations of noise are independent. 

Other components are not fully independent from re- 
alization to realization. Galactic components, the model of 
which is heavily constrained by WMAP observations, do not 
change much. The Sunyaev Zel'dovich map is fixed (i.e. the 
same SZ template map is used in all simulations). A frac- 
tion of point sources remain similar (they are based on the 
positions of real sources) although their spectral emission 
law depends on the realization. An additional population of 
point sources, generated to correct for the sky coverage of 
point source surveys to homogenize the point source distri- 
bution, is generated independently for each sky realization. 



FastICA 



0.0% Calibration Error 

0.1% Calibration Error 

0.2% Calibration Error 

— 0.5% Calibration Error 
1 .0% Calibration Error 



4 RESULTS 

In this section we present both analytical and numerical 
results obtained after including the presence of calibration 
errors in the ILC and ICA component separation methods. 
The success or the failure of a method will be evaluated 
as follows. We construct the output CMB map estimates 
by ILC or ICA as well as the residual map, which is the 
difference map between the estimated output CMB map and 
the simulated input CMB map. We compute the RMS value 
of each of these maps and compare them. We also evaluate 
both the multiplicative factor a and the additive error c(p) 
(introduced in section p}]), characterizing the reconstruction 
errors. 



4.1 Compared reconstruction error 

The average root mean square of the reconstruction error 
s — s, over all simulations for the Planck experiment, is com- 
puted in 10 bands of varying galactic latitude. The relative 
error, r = E(s — s)/E(s), for both FastICA and the ILC, 
is plotted in figures [2] and [3] 

As we expected, FastICA is almost completely unaf- 
fected by calibration errors. Because no assumption on the 
relative calibration is used, the overall calibration error just 
adds some small extra variance on the overall level of the 
extracted CMB. 

ILC, however, is not so well behaved as FastICA. While 
ILC is somewhat better than FastICA at extracting the 
CMB when calibration is perfect, it quickly becomes worse 
as calibration errors of increasing magnitude are applied. 
Fig. [4] shows the output of a particular realization at 1% 
calibration error where ILC performed especially poorly, 
compared with the input CMB plotted on the same scale. 
The variance of the ILC output is much lower than the true 
CMB, and CMB features are strongly suppressed. As ILC 
attempts to find the minimum- variance output, it finds that 
with calibration errors it is possible to partially cancel the 
CMB to get the lowest possible variance output. 



Figure 2. Plot of the relative error of FastICA as a function of 
galactic latitude. Generated using 128 simulations for each case. 
As expected, the relative error of FastICA has very little depen- 
dence upon the calibration error. 



ILC 



s 

°!° 0.6 



0.0% Calibration Error 
0.1% Calibration Error 
0.2% Calibration Error 
0.5% Calibration Error 
1.0% Calibration Error 



Figure 3. Plot of the relative error of ILC as a function of galac- 
tic latitude. Generated using 128 simulations for each case. Unlike 
FastICA, ILC shows tremendous sensitivity to the calibration er- 
ror, causing a noticeable reduction in the quality of the extraction 
of the CMB even at the optimistic 0.1% calibration error level. 



4.2 Interpretation of the ILC failure 

The impact of calibration errors on ILC weights, and on 
the output CMB map, is analytically explored in Appendix 
[A] Here we highlight that the signal-to-noise ratio plays a 
decisive role on this impact. 

The ILC method is a linear combination of the maps 
observed in different frequency channels, s = WiXi. The 
ILC combination has minimum variance under the con- 
straint 



(8) 
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Figure 4. Input CMB and ILC-estimated CMB plotted on a 
0.2mK scale for one realization at 1% calibration error with par- 
ticularly bad output (relative error near 1.0). Note that the vari- 
ance of the ILC output is far below the input CMB, indicating 
that the input CMB was largely canceled. 



The constraint in principle guarantees the CMB con- 
servation, otherwise Wi = for all i would minimize the 
variance. If the calibration <n is wrong then the CMB con- 
servation is no longer guaranteed. In some cases, when the 
signal-to-noise ratio is large enough, it can be dramatic for 



the CMB extraction (see section 4.3 1. 

As discussed above, the reconstruction error arises from 
two terms. A multiplicative term, i.e. a global calibration 
error term, and an additive contamination term. We can 
write the estimated CMB map as a function of the true 
CMB map as: 

s(p) = as(p) + c(p), 

where a is the global calibration coefficient, and c(p) the 
contamination by foregrounds and noise. Figure|5]shows this 
parameter a versus the input map calibration error. The 
presence of calibration errors S ai modifies the calibration 
coefficients in each channel as a 4 — > a,i + 5 ai , where 8 ai <C a.i- 
We may explicitely expand the multiplicative error and the 
additive error in terms of the calibration errors S ai and the 
ILC weights Wt: 

s(p) = ^WiXiip) 

i 

= m (en + 8 ai )s(p)+ (p) 



Figure 5. This figure shows the overall calibration coefficient 
of the output CMB map, computed from the known calibration 
errors in the inputs and the weights applied to obtain the output. 
The error bars represent the RMS of a among the 128 realizations. 
For FastICA, the calibration coefficient is centered very near one, 
with an RMS of approximately 1.5 times the map calibration 
error. By contrast, ILC has a CMB calibration that is perfect 
if the map calibration is perfect, but this quickly turns into a 
significant bias with large uncertainties as to the final calibration 
value. 



where the ILC weights u>i satisfy the constraint £\ Wi<M = 1. 
Thus we have 



i 

a = 1 + w i$°> 



(10) 
(11) 



The additive error term c(p) = ^2 i WiUi(p) is responsi- 
ble for a bias in the CMB estimation because of foreground 
and noise contaminations even in the absence of calibration 
errors. Delabrouille et al. (2009) have explored the impact 
of this term on the ILC estimation of the CMB and have 
found that, in addition to the standard reconstruction error 
due to foreground and noise contamination, there is a bias, 
E (s ■ (s — s)), due to the estimation of second order statis- 
tics on samples of finite size. Both errors contribute to the 
variance of the output CMB map as 

E (I 2 ) = E (s 2 ) +E({§- sf) + 2E[s-{s- a)) . 

The multiplicative error term a = 1 + J^. WiSi becomes 
non trivial in presence of calibration errors because the ILC 
weights Wi, as derived in equation (A6l of Appendix |A] do 
not depend only on the calibration errors 5 ai but also on 
the signal-to-noise ratio cr 2 /?" 1 ^-, where a 2 = E(s 2 ) and 
(Rn)ij = EijiiTij) denote respectively the variance of the 
CMB signal and the covariance matrix of the noise (includ- 
ing foregrounds). 



4.3 Importance of the signal to noise ratio 



*(p). 



(9) 



From the exact expression (|A6j) of the weights we may write 
the multiplicative factor a — 1 + w 8 a as 
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a'R„ 1 a + a'R,, 1 ^ 



(a'Rn 1 a)(5 t a R^ 1 a a ) - (a*R„ <5 a ) 2 



The immediate consequence of equation ( 12 1 is the existence 
of two regimes. 

If the signal-to-noise ratio is small enough compared to 
the inverse of the calibration error, typically, 

if ^SlR^Sa < 1 then a « 1 + 0(\d a \/\a\), 

because the expression proportional to a 2 becomes negligible 



was correctly estimated. Consequently, the CMB estimate 
is of minimum variance since E(s 2 ) « 0, but of course com- 
(12) pletely removes the expected input CMB, rendering the ILC 
totally irrelevant. 

We may explain the process as follows. In presence of a 
calibration error in one channel the ILC algorithm minimizes 
the variance of 



in ( 12 1. So we tend to recover the almost perfect CMB recon- 



struction close to the case of no calibration error (S a = 0). 

If the signal-to-noise ratio becomes large enough then 
the reconstruction of the CMB signal may be dramatically 
damaged. This is the main result of this paper. Typically, 

if CT 2 <5^ l R^ 1 (5 a ^> 1 then a « 0, 

the multiplicative factor goes to zero since the expression 
proportional to a 2 dominates all the other terms in (121, 



in which case the ILC estimation completely "kills" the ex- 
pected CMB signal, s(p) w c(p). 

Let us complete the discussion by relating the first and 
second moments of the output CMB s and the reconstruc- 
tion error s — s to the multiplicative and the additive errors. 
Considering that the CMB and the noise (including fore- 
grounds) are independent random signals, E(riis) = 0, and 
assuming that E(rii) — 0, we get 



E(s 



= (a-l)E(s) 



E(s-(s-s)) = (a-l)E(s J ), 
E ((S - sf) 



= {a-l) 2 E{s 2 ) + E(c<-^ 2 



(Pf) , 



(13) 



where E (c(p) 2 ) = w t R n w. The detailed expression of these 
moments in terms of the calibration errors and the signal-to- 
noise ratio is derived in Appendix [A| From (131, once again, 
if the signal-to-noise is large enough then the reconstruction 
of the CMB is biased since a moves away from one to reach 
zero. 



4.4 A simple example 

Here we show a schematic description of the process using 
a simple example. We consider a two-channel case: 



X2 



0.99s + 7ii 
s + n 2 . 



(14) 



Here s is the CMB, Xi is the i* channel of the data, and 
m is the foregrounds plus instrument noise. The calibration 
coefficients are equal to one and a calibration error of one 
percent has been considered in the first channel. If the signal- 
to-noise ratio is large enough, e.g m/s <C 0.99, then the 
noise is negligible in the observed maps 



xi w 0.99s 

X2 W S. 



(15) 



The ILC estimate of the CMB thus reduces in that case to 

§ w lOOzi - 99x 2 , (16) 

where the weights satisfy the constraint 100 — 99 = 1, which 
would guarantee the CMB conservation if the calibration 



S = (0.99uil + I02)s + tUlTll + W2U2. 



(17) 



We can contrast this what we would get without calibration 
errors, 



= (Wl + 102)5 + TOirti + W2T12. 



(18) 



With the constraint that 101+102 = 1, the contribution of the 
CMB signal to s is always s. This indicates that the weights 
will take whatever values they need to take to minimize the 
contribution of the noise. 

However, in the presence of calibration errors, it be- 
comes possible for the contribution of s to s to vary de- 
pending upon the choice of weights, indicating that a min- 
imization of the variance of s will introduce some compe- 
tition between minimizing (0.99toi + to 2 )s and minimizing 
wini + W2ri2. For the following weights 



Wl = 
102 



inn. 

-99, 



(19) 



the contribution of the CMB to s will be identically zero. 
This is what the ILC produces in the limit of the signal-to- 
noise ratio becoming very large with respect to the calibra- 
tion error. In the opposite limit, that of small signal-to-noise 
ratio, it is the minimization of the second term, iuini+u; 2 n 2 , 
that drives the minimization of s, which mimics the behavior 
under the assumption of no calibration error. 



4.5 The case of Planck 

In table[l]we present the results often simulations of the sky 
with an ILC estimation of the CMB in presence of 0.1 %, 
0.5 % and 1 % calibration errors for the Planck experiment 
(9 frequency channels). 

For 1 % we observe a significant bias affecting the 
CMB reconstruction by ILC. The multiplicative factor a — 
0.665 (table [T]) indicates that the CMB estimate eliminates 
roughly 33 % of the input CMB. The high sensitivity of 
Planck means a large signal-to-noise ratio, comparable to 
the inverse of the calibration error, which leads to a poorly 
extraction of the CMB by ILC, as expected from the formula 
( 12 1. For 0.5 % calibration errors, 15 % of CMB is eliminated 
by the ILC estimation. Finally for 0.1 % calibration errors, 
1 % of CMB is eliminated by the ILC estimation, which is 
nevertheless ten times the calibration error - and clearly not 
acceptable for precision cosmology with Planck. 



4.6 The case of WMAP 

In table[2]we present the results often simulations of the sky 
with an ILC estimation of the CMB in presence of 1 % cali- 
bration errors for the WMAP experiment. We observe a neg- 
ligible bias affecting the CMB reconstruction by ILC. The 
multiplicative factor a m 0.99 (table [5| indicates that the 
percentage of eliminated input CMB by ILC is for WMAP 
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[htbp] 



Planck 


1 % 


0.5 % 


0.1 % 


mult, factor ct 


0.66500 


0.85258 


0.99237 


add. error E(c(p)) (mK) 


6.208c-2 


3.455c-2 


1.845e-2 


E(c(p) 2 ) (mK 2 ) 


1.231c-2 


3.46c-3 


5.6e-4 


E((s-s) 2 ) (mK 2 ) 


4.26c-3 


1.91c-3 


5.5e-4 


E(s ■ (1 - «)) (mK 2 ) 


-3.26e-3 


-1.18c-3 


-1.3c-4 


E(s 2 ) (mK 2 ) 


7.42e-3 


7.42c-3 


7.42c-3 


E(s 2 ) (mK 2 ) 


5.16e-3 


6.99c-3 


7.71e-3 



Table 1. ILC reconstruction errors for Planck in presence of 1 %, 0.5 % and 0.1 % calibration errors. 



[htbp] 



WMAP 


1 % 


mult, factor a 


0.98709 


add. error E(c(p)) (mK) 


1.129e-2 


E{c{p) 2 ) (mK 2 ) 


6.9e-4 


E((s- s) 2 ) (mK 2 ) 


6.5e-4 


E(s- (s- s)) (mK 2 ) 


-2.4e-4 


E(s 2 ) (mK 2 ) 


5.15e-3 


E(s 2 ) (mK 2 ) 


5.33c-3 



Table 2. ILC reconstruction errors for WMAP in presence of 1 % 
calibration errors. 



of order of the calibration error, i.e 1 %, as expected from 



formula ( 12 1 when the signal-to-noise ratio is small enough. 
The sensitivity of WMAP is small enough to render the ILC 
estimation of the CMB insensitive to calibration errors. 



4.7 Actual WMAP ILC 

The above result for WMAP was obtained assuming that the 
ILC is performed on the masked sky of figure[l] In fact, ILC 
weights used by the WMAP team have been computed in 
a different way, by subdividing the sky into twelve regions. 
Since the value of their weights are known, as well as the 
mean calibration error, we may easily evaluate the error of 
the reconstruction performed by the WMAP team. 

The order of magnitude of the ILC weights w™ MAP 
computed by the WMAP team is comprised between 1CP 2 



and 3 ( Hinshaw et al. 2007 1 and the relative calibration er- 
rors have been estimated by the WMAP team to be of the 
order of 5 a ,i ~ 0.2%. 

In the subdivision of the sky by the WMAP team the 



region zero ( Hinshaw et al. 2007 1 corresponds to the part of 



the sky outside the galaxy and thus dominated by the CMB 
signal. A priori, since the signal-to-noise ratio is the highest 
in that high galactic latitude region, one might expect the 



effect of the calibration errors to be large. This is not the 
case, however. 

We may estimate the maximum percentage of elimi- 
nated CMB in the region zero as follows: 



II 



WMAP j 
Wi O a , 



E 

i 

0.002^ 



sC 7-10" 



0.7%, 



(20) 



where wf ' MAP are the IL C weights computed by the WMAP 
team in the region zero (Hinshaw et al. 20071. Therefore, 



the maximum percentage of eliminated CMB has the order 
of magnitude of the calibration error, i.e fewxlO" 1 %, which 
is small|^]So the multiplicative factor for the actual WMAP 
ILC in presence of 0.2 % calibration errors is close to one, 
with a minor loss of CMB power: 

a > 0.993. 

Interestingly, the ILC weights used at high galactic lat- 



itude by the WMAP team (Hinshaw et al. 2007) have been 



computed in a low galactic latitude region of the sky, where 
the signal-to-noise ratio is sufficiently small. This certainly 
explains why the ILC weights are close to those expected 
with no calibration errors and why the multiplicative factor 
is close to one. Therefore the calibration uncertainties do not 
have a strong impact on the ILC weights computed in the 
WMAP third year data release. The bias due to calibration 
errors is negligible. 

The price paid for this, as emphasized by |Delabrouille| 
|et al.| ( |2009[ ), is that at high galactic latitude the WMAP 
weights are chosen to cancel galactic foregrounds rather 
than instrumental noise, a sub-optimal choice away from 
the galaxy, particularly for small scales. 

4.8 Other ILC performed on WMAP 

Several authors have used a version of the ILC to analyze 
WMAP data. The present paper warns the users of the cor- 

4 It should be noticed that this bound is a rough estimation since 
we do not have access to the real value of the calibration error for 
each frequency channel. 
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responding data sets that in presence of calibration errors, 
some CMB power may be lost in the maps obtained. Further 
investigation would be needed to evaluate the exact impact 
for each individual recovered CMB map. 



4.9 De— biasing 

A natural question to ask is whether, since the effect of cali- 
bration errors is to introduce a loss of CMB power, it would 
not be possible to correct from this effect and 'recalibrate' 
a posteriori in some way. 

First of all, this can not be the optimal solution, as 
the noise contribution to the total error would be increased 
accordingly. The proper solution would be to get the right 
calibration beforehand. As we can see from figure [5] that the 
variance of a seems to be of order 1 — a. This indicates that 
the maximum improvement on the level of the CMB is to 
reduce the expectation value of |1 — a by around a factor of 
two. As 1 — a becomes very large very quickly, this will not 
help when the calibration is not already very good compared 
to the signal to noise ratio. 

Finally, even the knowledge of the expectation value of 
a is not very easy to get. Simulations give an estimate of its 
amplitude, but the actual value may depend on details, for 
which simulations are not guaranteed to be representative. 

Hence, we leave this question open for further investi- 
gations. 



4.10 Impact of the number of channels 

Tests performed varying the number of channels used to per- 
form the ILC with Planck data show that ILC does better 
with calibration errors if fewer channels are used. To add 
new data, but end up with worse estimation of the desired 
products, indicates that the new data is not being used ef- 
fectively, to say the least. 

The reason for this degradation of the performance of 
the ILC when more channels are added is easy to under- 
stand. As discussed in section |4~3| the ILC can erroneously 
cancel out part of the CMB if the signal to noise ratio is 
larger than the inverse of the calibration error, i.e. if 



^SiR^Sa » 1. 



(21) 



As R n 1 and R n are symmetric matrices, they can be diago- 
nalized, and we can write: 

R,; 1 = O'D^O, 

where O is an orthonormal matrix, and D^" a diagonal ma- 



trix. The condition of eq. (211 then becomes 



^(OSafD-^OSa) » 1. 



(22) 



Matrix O preserves the norm, and thus elements of OS a are 
of the same order as those of S a - It then suffices that one of 
the eigenvalues of D„ be small for a 2 5aR^" 1 5 a to be large, 
causing the CMB power loss discussed in this paper. 

Now recalling that R n is the covariance matrix of noise 
+ foregrounds, it is easy to understand why more channels 
cause more problems with Planck. Foregrounds are signifi- 
cantly brighter than the noise, and comparable in amplitude 
to the CMB over a fraction of the sky. If they span a space 
of dimension equal or greater than the number of channels, 



matrix D n will have no small eigenvalue. If on the other 
hand they span a space of dimension less than the number 
of channels, matrix D n will have at least one small eigen- 
value, generating the 'CMB loss' problem. 

Physically, this is understood in the following way: if 
there are few channels, the minimization of the variance of 
the ILC linear combination will be achieved by canceling 
foregrounds primarily. If however there are additional chan- 
nels which are not needed to cancel out the foregrounds, the 
extra channels leave more freedom for the ILC weights to 
adjust themselves so as to cancel part of the CMB as well. 



5 CONCLUSION 

The primary conclusion of our work is that some care is re- 
quired for performing component separation in presence of 
calibration errors, in particular for sensitive multichannel in- 
struments such as Planck. We have shown that two different 
component separation algorithms, FastICA and ILC, behave 
very differently in the presence of calibration errors. FastICA 
is completely unaffected, while ILC can become biased by 
a significant amount with even small calibration errors. We 
propose that those attempting to make use of these or other 
component separation techniques pay close attention to how 
calibration errors affect their results. Some techniques will 
doubtlessly be completely unaffected, as FastICA was, while 
others may be very sensitive like ILC. 

We also note that due to the fact that ILC in the pres- 
ence of sufficient calibration errors biases the variance of the 
CMB low, and because we have a lower limit upon the vari- 
ance of the CMB from WMAP, through its measurement of 
the CMB power spectrum up to about I — 900, the variance 
of the ILC output may prove a useful diagnostic test if the 
calibration of Planck was performed well. The ability to use 
this as a cross-check on calibration also indicates that for a 
Planck-style mission we expect to recover, at a minimum, 
around 0.1%-0.2% relative calibration error. The reasoning 
for this is that if the calibration error is worse, then ILC 
will produce a CMB map that is of lower-variance than a 
similar map from WMAP, which, in turn, tells us that the 
calibration wasn't very good. If we have information that 
the calibration wasn't as good as it could have been, then 
it is reasonable to expect that it is possible to improve said 
calibration. 

Note that even though FastICA is not biased where 
ILC is, it is not clear that FastICA is better. ILC does seem 
to produce lower errors in extracting the CMB, as seen in 
figures [2] and [3] The biasing is troubling, but ILC retains 
lower extraction error up to somewhere between 0.1% and 
0.2% calibration error, at least at high galactic latitudes. 
If the calibration error is good enough, then we still expect 
ILC to remain a very useful method for extracting the CMB. 
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APPENDIX A: ANALYTIC ANALYSIS OF ILC 

In this appendix we analytically derive the bias of the ILC estimator generated by calibration errors, and look at the impact 
of the signal-to-noise ratio on this bias. In particular we show that, even in presence of small calibration errors, the ILC tends 
to poorly extract the CMB if the signal-to-noise ratio is large. 



Review of ILC 

We model the data as: 



;(p) = a s(p) + n(p), 



(Al) 



where x(p) is the vector of the observed data at the pixel (or harmonic mode, or needlet coefficient) p for the set of frequency 
channels, s(p) is the CMB signal, and n(p) is the vector of corresponding noise (including both foregrounds contaminants and 
instrumental noise) for the set of frequency channels, a is a vector which contains the frequency scaling of the component, so 
that in the case of CMB with no calibration errors one has a — (1, 1, 1). In the following we omit the index p. 

We use a 2 and R„ respectively to denote the variance of the CMB signal s and the covariance matrix of the noise n 
(including foregrounds) , and we assume that s and n are independent such that the covariance matrix of the observed maps 
is R x = (R n + <r 2 aa i ). The ILC implements an approximation of the ideal filter 



a R x 



a l R x a 



(A2) 



which is an unbiased minimum variance estimate of s. In practice, the covariance used in the ILC is an empirical estimate 
on a sample of finite size, and t hus slightly differs from i ts ensemble average (R x = R x + A x ). This induces a bias in the 
variance of the ILC, as shown by Delabrouille et al. (20091. In this appendix we assume R x = R^ to investigate the bias that 



stems from errors in calibration alone (d = a + 5 a ) 



Bias due to calibration errors 

An imperfectly estimated frequency scaling vector a includes calibration errors S a which introduce a discrepancy between the 
observed data and the ILC weights that are used to reconstruct the CMB. This discrepancy is expected to be responsible for 
a bias in the reconstruction. Due to calibration errors S a the observed map is modified as 

x = (a + S a ) s + n, 

R x = R n + a 2 (a + S a )(a + S a )\ (A3) 
such that the ILC estimate become^f] 

„ a' [R„ + (7 2 (a + <5 a )(a + <5 a ) t ]~ 1 rx 

1 - ((a + <5 a ) s + n) . (A4) 



a* [R„ +a 2 {a + S a )(a + S a yy 
Making use of the inversion formula: 

tj-1 rn , 2/ . c w ,c Nil- 1 r>-i 2 R„ 1 (a + 5 a )(a + <5 a )*R n 1 
R x = [R n + a (a + S a )(a + S a ) =R n - a — — y - — w P -i, , n . 

1 + cH(a + OaYRn (a + da) 

we obtain the weights w of the ILC (s = w t x) as 

t a'R" 1 + <r 2 a*R~ 1 (a*R~ 1 5 a + <5* R^ 1 (5 a ) — cr 2 J„R~ 1 (a'R~ 1 a + a t R~ 1 5 a ) 
w = — 

In the absence of calibration errors (S a — 0) we recover the standard ILC weights: 



(A5) 



(A6) 



a l R n a a l tl x a 



where the second equalitjQ comes from the inversion formula ( A5 I . We see that the presence of calibration errors induces a 



departure from the standard ILC weights through correction terms which explicitly depend on the signal-to-noise ratio ct 2 R^ 1 . 
Typically if the signal-to-noise ratio is much smaller than the inverse of the calibration error squared (e.g 8 a a 2 R^ 1 8 a <C 1) 
then the calibration errors will have little impact and the standard ILC weights are relevant. Else if the signal-to-noise ratio 
becomes comparable to the inverse of the calibration error squared then the impact of calibration errors may be more dramatic 



5 It should be noted that the derivation would have been of course completely equivalent if we had considered calibration errors into the 
ILC weights instead of into the data. 

6 In practice we have only access to the covariance matrix of the observed maps but not to the noise covariance matrix R n (including 
foregrounds) for constructing the ILC estimate. But theoretically the both representations of the ILC estimate are identical. 
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on the CMB reconstruction since the variance of the ILC may be much lower than the true CMB. In the exact expression ( A6 1 
we intentionally conserved second order terms in S a because they play a role of regularization terms when the signal-to-noise 
ratio cr 2 R^" 1 goes to infinity. 

As a simple illustration let us apply the above result to the following example with two frequency channels and a diagonal 
noise covariance matrix: 

xi = 0.99s + 7ii 

x 2 = s + n 2 . (A8) 
Here the calibration error is one percent. We note a 2 = E(s 2 ), R n = diag[cr 2 , erf] , where of = E(n 2 ). In this example 



a* = (1,1) and <5 a = (—0.01,0) so that the expression (A6l of io* = (wi,W2) reduces to 



Wl = / Sal (A9) 
^+S 2 al 

£ + (l + S al )S al 

r °al 

If the signal-to-noise ratio becomes very large (i.e a 2 /a 2 — » 0) then Wi « —l/5 a i = 100 and W2 ~ (1 + S a i)/S a i = —99, 
so that the output CMB vanishes s « lOtei — 99x2 ~ 100ni — 99n2 ~ when the noise is negligible. If the signal-to-noise 
ratio becomes very small then Wj ~ erf /(of + a 2 ) and u>2 ~ a 2 /(a 2 + o^), which is the standard least mean square solution. 

The ILC estimate is given by 



s — iu'(a + 6 a )s + lo'n, 

a'R.^ 1 a + a*R^ <S a 



s + io*n, (All) 



atR-^ + a 2 (a'R.- 1 a)((5iR.- 1 (5 a ) - o-^a'R" 1 ^ 
We assume E(n) = 0, the ILC estimate is thus biased as 

E(S) = = a'R^a + a'R^Sa 

We see that in the limit of small signal-to-noise ratio, a 2 (R^ 1 ) <C 1, the bias is of order of magnitude of the calibration 
error: E(s)a t R~ 1 S a /a t R~ 1 a. Whereas if a 2 (R^ 1 ) 2> 1 then the bias is accentuated since E(s) — > 0. 
Let us compute the mean value of the error d = § — s (assuming E(n) — 0): 

£(d) = E(w*(a + 6 a )s- s) 

a'R-^g - a 2 [(a^aX^R-X) - (^R'X) 2 ] 
atR-'a + a 2 [(a t R- 1 a)(<J t a R- 1 <5 a )-(a t R- 1 5 a ) 2 ] [Sh 

so that E(d) -E(s) when a 2 (R^ 1 ) j . > 1, and i5(d) -> when a 2 (Rn 1 ) i . < 1 and <5 a < a. 

In the same way the cross correlation E(s d) of the error of the reconstruction with the CMB signal 

„, a'R" 1 ^ — \{a t Rn 1 a)(6 t a R~ 1 6 a ) — (a t R~ 1 S a ) 2 ] 2 , . , ^ 

_E(s d = 5 p 5 7 5 ! f- er , (A14) 

v ; a^R-^ + a 2 [(aiR n 1 a)(<5 t a R,T 1 <5 a )-(a t R- 1 5 [l ) 2 ] V ; 

may vanish only if a 2 (R^ 1 )^. <C 1. If the signal-to-noise ratio becomes large enough then E(s d) — ► —a 2 , giving evidence of 
the cancellation of the CMB output. 

We may also compute the variance of the error. Since the CMB and the noise are uncorrelated we get 

E(d 2 ) = (w l 8 a) 2 a 2 + w t R n w 



a t R n 1 8a — u 2 [(a t R n 1 a)(8 t a R n 1 8 a ) — (a t R n 1 S a 
a'K'a + a 2 [(o*R^ 1 a)((S t R^ 1 «5 ) - (a'R^X) 



2 

2 



| a'R^a + a 2 (2 + gVR^a) [(a^aX^R^ ^a) - (a'R- 1 S a ) 2 ] 
(a^R-V + a 2 [(a^aX^R" 1 ^) - (a'R- 1 J a ) 2 ] ) 2 
Notice that the second term has been truncated at second order in <5 a . If a 2 (R^ 1 )^. <C 1 then we recover the standard 



reconstruction error with no calibration error: E{d 2 ) « l/(a'R" 1 a), as computed by Delabrouille et al. «2009b. If the 
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signal-to-noise ratio becomes large enough then w t R n w becomes negligible compared to the first term (w t S a ) a 2 such that 
E(d 2 ) w a 2 , again giving evidence of the cancellation of the CMB output. 
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